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(57) Abstract 



X-ray crystallography can be used to screen compounds that are not known Ugands of a target biomolecule for their ability to bind the 
target biomolecule. The method includes obtaining a crystal of a target biomolecule; exposing the target biomolecule crytal to one or more 
test samples; and obtaining an X-ray crystal diffracdon pattern to determine whether a ligand/ieceptor complex is formed. The target is 
exposed to the test samples by either co-ciystallizing a biomolecule in ttie presence of one or more test samples or soaking the biomolecule 
crystal m a solution of one or more test samples. In anodicr embodiment, structural infomiation from ligand/reccptor complexes are used 
to design ligands that bind tighter, that bind more specifically, that have better biological activity or that have better safety profile. A 
further embodiment of the invention comprises identifying or designing biologically-active moieties by the Instant process. In a further 
•j embodiment, a biomolecule crystal having an easily accessible active site is formed by co-ciystallizing the biomolecule with a degradable 
ligand and degrading the ligand. * 
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. Ligand Screening and Design by X-ray Crystallography 

Technical' Field of the Invention 

X-ray crystallography is useful for identifying ligands that bind target receptor 
5 molecules and for designing ligands with improved biological activity for the target ' " 
receptor. 

Background of the Invention 

X-ray crystallography (crystallography) is an established, well-studied technique 
10 that provides what can be best described as a three-dimensional picture of what a molecule 
looks like in a crystal. Scientists have used crystallography to solve the crystal structures 
for many biologically important molecules. Many classes of biomolecules can be studied 
by crystallography, including, but not limited to, proteins, DNA. RNA and viruses. 
Scientists have even reported the crystal structures of biomolecules that carry ligands 
15 within its receptors (a "ligand-receptor complex"). 

Given a "picture" of a target biomolecule or a ligand-receptor complex, scientists 
can look for pockets or receptors where biological activity can take place. Then scientists 
can experimentally or computationally design high-affinity ligands (or drugs) for the 
receptors. Computational methods have alternatively been used to screen for the binding 

20 of small molecules. However, these previous attempts have met with limited success. 
Several problems plague ligand design by computational methods. Computational 
methods are based on estimates rather than exact determinations of the binding energies, 
and rely on simple calculations when conripared with the complex interactions that exist 
within a biomolecule. Moreover, computational models require experimental 

25 confirmation which often expose the models as false positives that do not work on the real 
target. 

Moreover, experimental high-affinity ligand design based on a "picture" of the 
ligand-receptor complex has been limited to biomolecules that already have known 
ligands. Finally, scientists only recently reported the crystal lographic study of interactions 
30 between organic solvents and target biomolecules. Allen et al., J. Phvs. Chem. . v, 100, pp. 
2605-1 1 (1996). However, these studies are limited to mapping solvent sites rather than 
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ligand sites. It would be desirable to directly identify poteiitial ligands, and to obtain 
detailed infonnation on how the ligand binds and changes in the target biomolecule. In 
addition, methods for identifying and/or designing ligands which possess biological and 
/or pharmaceutical activity with respect to a given target molecule would be desireable. 

5 

Brief Summary of the Invention 

Crystallography can be used to screen and identify compounds that are not known 
ligands of a target biomolecule for their ability to bind the target. The method (hereinafter 
"CrystaLEAD™") comprises obtaining a crystal of a target biomolecule; exposing the 
10 target to one or more test samples that are potential ligands of the target; and determining 
whether a ligand^iomolecule complex is formed. The target is exposed to potential 
ligands by various methods, including but not limited to, soaking a crystal in a solution of 
one or more potential ligands or co-crystallizing a biomolecule in the presence of one or 
mote potential ligands. 

15 In a further embodiment, structural information from the ligand/receptor 

complexes found are used to design new ligands that bind tighter, bind more specifically, 
have better biological activity or have better safety profile than known ligands. 

In a prefen-ed embodiment, libraries of "shape-diverse" compounds are used to 
allow direct identification of tiie ligand-receptor complex even when the ligand is exposed 
as part of a mixture. This avoids tiie need for time-consuming de-convolution of a hit 
from the mixture. Here, tfuee important steps are achieved simultaneously. The 
calculated electron density function directly reveals the binding event, identifies the bound 
compound and provides a detailed 3-D structure of the ligand-receptor complex. In one 
embodiment, once a hit is found, one could screen a number of analogs or derivatives of 
the hit for tighter binding or better biological activity by traditional screening methods. 
Another embodiment uses the hit and infonnation about stracture of the target to develop 
analogs or derivatives with tighter bindmg or better biological activity. In yet anotiier 
embodunent, the ligand-receptor complex is exposed to additional iterations of potential 
ligands so that two or more hits can be linked together to make a more potent ligand. 

30 



20 



25 
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Brief Description of the Drawings 

Figure 1 illustrates a structure-based drug design where an initial lead compound is 
found, and then used as a scaffold to cany additional moieties that fit the subsites that 
surround a major site. 

Figure 2 illustrates a fragment linking approach for a biomolecule having two or 
more adjacent primary pockets . 

Figure 3 is an outline of CrystaLEAD™ wherein a crystal is soaked in a solution of 
various potential ligands (IrI,o) and an X-ray diffraction dataset is collected and 
transformed into an electron density map which is inspected for compound binding. 

Figure 4 illustrates a typical compound mixture in 2-D and 3-D. The 3-D figures 
are theoretical 2Fo-Fc electron density maps that represent the "shape" of the molecules. 

Figure 5 is a primary sequence of human urokinase. 

Figure 6 illustrates how a hit was detected and identified by shape after urokinase 
was soaked in a solution containing a mixture of potential ligands. Figure 6A is the initial 
15 Fo-Fc map. Figure 6B shows how the compound binds at the active site.of urokinase. 
Figure 6C illustrates the active site without a bound ligand when no compound of the 
mixture has bound. 

Figure 7 illustrates a hit for urokinase soaked in a solution containing a mixture of 
potential ligands. Figure 7A is the initial Fo-Fc map. Figure 7B shows how the 
20 compound binds at the active site of urokinase. 

Figure 8 illustrates a hit for urokinase soaked in a solution containing a mixture of 
potential ligands. . Figure 8 A is the initial Fo-Fc map. Figure 8B shows how the 
compound Binds at the active site of urokinase. 

Figure 9 illustrates two additional hits for urokinase soaked in a solution 
25 containing a mixture of potential ligands. Figure 9 A is the Fo-Fc map for a strong ligand 
within the mixture. Figure 9B is the Fo-Fc map for a weaker ligand within the mixture. 
The weaker ligand was detected only after the strong ligand was removed from the 
mixture. 

Figure 10 illustrates the comparative crystal structures between a lead compound 
30 found by CiystaLEAD""^ and an optimized follow-up compound. 
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Figure 1 1 illustrates hits that were identified for VanX. 
Figure 12 illustrates a hit for urokinase. 

Figure 13 illustrates the crystal structure of compound 44 with ErmC. 
Figure 14 illustrates the crystal structure of compound 45 with ErmC . 

Detailed Description of the Inventinn 

CrystaLEAD™ provides an efficient screening method for identifying compounds 
that will bind to a target biomolecule. Such compounds can serve as leads or scaffolds to 
design ligands and/or drugs that have improved biological activity for the target. One 
must note that tighter binding ligands do not necessarily provide better biological activity 
or make a better drug, although this is the general rule. It is possible for a weaker binding 
ligand to provide better biological activity due to factors other than tight binding (e.g.. 
selectivity, bioavailability). 

' Crystallography has been used extensively to view receptor-ligand complexes for 
structure-based drug design. To view such complexes, known ligands are usually soaked 
into the target molecule Crystal, followed by crystallography of the complex. Sometimes, 
it is necessary to co-crystallize the ligands with the target molecule to obtain a suitable 
crystal. . ~ 

Until now, crystallography has not been unplemented to screen potential ligands 
despite the detailed structural information that it provides. Possible prejudices against 
screening compounds by crystallography include the belief that the method is too 
complicated or time consuming, that suitable crystals are difficult to obtain, that available 
crystals could not tolerate soaking more than one compound (much less mixtures often or 
more compounds), that too much biomolecule would be needed, that it would he too time 
consuming to routinely mount crystals, and that constantly changing crystals on tiie x-ray 
goniometer would be too tedious. 

However, currently available technology has overcome many of these perceived 
barriers. For example, at one time, molecular targets were only obtained from natural 
sources and were sometimes unsuitable for crystallization due to natural degradation or 
glycosylation. In addition, Uie natural concentration was often too low to obtain tiie 
amount of highly purified protein necessary for crystallization. With molecular biology, 
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large amounts of protein may be expressed and purified for crystallization. When 
necessary, the protein can even be re-engineered to provide different or better crystal 
forms. 

Further, brilliant light sources (synchrotron radiation) and more sensitive detectors 
5 have becomcTcadily availaWe so that the time required to collect data has been reduced 
dramatically from days to hours or even minutes. Furthermore, existing technologies 
which are less routine at this time, but may become routine soon, allow full data set 
collections in the order of seconds or even fractions of a second (e.g., Laue diffraction), J. 
Hajdu et al.. Nature, v. 329. pp. 178-81 (1987). Faster computers and more automation 
10 software have greatly decreased the time required for data collection and analysis. Finally, 
the inventors have discovered that it is possible to soak or co-crystallize mixtures of 
compounds to screen for potential ligands. Thus, as described below, crystallography is 
now a practical and feasible screening method. 

In CrystaLEAD™, ligands for a target molecule having a crystalline form are 
15 identified by exposing a library of small molecules, either singly or in mixtures, to the 

target (e.g. protein, nucleic acid, etc.). Then, one obtains crystallographic data to compare 
the electron density map of the putative target-ligand complex with the electron density 
map of the target biomolecule. The electron density map simultaneously provides direct 
evidence of ligand binding, identification of the bound ligand, and the detailed 3-D 
20 structure of the ligand-target complex. Binding may also be monitored by changes in 

individual reflections within the crystallographic diffraction pattern which are known to be 
sensitive to ligand binding at the active site. This could serve as a pre-screen but would 
not be the primary method of choice because it provides less detailed structural 
information. 

25 By observing changes in the level of ligand electron density or the intensity of 

certain reflections in the diffraction pattern as a function of ligand concentration either 
added to the crystal or in co-crystallization, one may also determine the binding affinities 
of ligands for biomolecules. Binding affinities may also be obtained by competition 
experiments. Here, the new compound(s) are soaked or co-crystallized with one of a 

30 series of diversely-shaped ligands of known binding affinity. If the known ligand appears 
in the electron density map, the unknown ligands are weaker binders. However, if one of 
the new compounds is found to compete for the site, it would be the tightest binder. By 
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varying the concentration or identity of the known ligand, a binding constant for the 
. CrystaI£AD™ hit may be estimated. 

The number of compounds screened is based upon the desired detection hmit, the 
compound solubility and the amount of organic co-solvent the crystals will tolerate. Exact 

.5 numbers aepend bh each crystal. For example, for a typical crystal Wt'tolefates r% 
organic co-solvent, the sensitivity limit would be Kd< 1.5mM to screen 10 compounds 
simultaneously. For 20 compounds, the sensitivity limit would be Kd<0.63mM. 
However, crystals that tolerate high organic co-solvents (e.g., 40%), can screen up to 50 

. compounds within a detection limit of Kd< 1.5mM. 

10 h the most general application of CiystaLEAD™, tiie hit or lead compound is 

used to determine what compounds should be tested for biological activity in structure- 
based drug design. Then derivatives and analogs are obtained by traditional medicinal 
chemistry to find the best ligand or drug. 

Alternatively, the structural information collected in the screening process can be 
15 used directiy to suggest analogs or derivatives of the hit. This approach is illustrated when 
the active site is composed of one primary pocket surrounded by a variety of subsites and 
smaU pockets (Figure 1). Detailed structural information about how a compound is bound 
by the receptor is obtained simultaneously as a hit is detected. Such information is useful 
to the ordinary artisan for designing better ligands. P. Colman. Curr. Ooin. in Struct. 
20 BiQlogy . V. 4. pp. 868-74 (1994); J. Greer et al., J. Med. Chem. . v. 37, pp. 1035-54 (1994); 
C Verlinde et al., Stnictqre. v. 15. pp. 577-87 (1994). lii particular, the hit identifies sites 
for analog syntiiesis which would permit access to the surrounding subsites and small 
pockets. This suggests the design of new compounds which better fit tiie active site. 
Furtiierraore, in cases where Uierc is an existing structure-function relationship, activity 
25 enhancing substitution patterns may be directiy transferred to tiie new lead scaffold at tiie 
3-D structural level. 

Anotiier illustiation (Figure 2) usually applies to a target tiiat has two or more 
separate pockets that will acicommodate fragments. Here, tiie crystalUne target is screened 
for ligands tiiat occupy aU of tfie sites eittier in sequence or simultaneously. Because ttie 
30 binding event is monitored by visualizing co-crystal stiiicturcs, tfie site of ligand binding is 
identified directly aiid tfiere is no need for competition experimeiits to assure that tiie 
ligands indeed occupy different sites on die protein. Screening separately allows for 
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ligands which bind to distinct pockets that overlap in their binding loci. Screening for the 
second in the presence of the first would detect cooperative binding at a second site. Once 
potential leads and a structure-activity relationship have been established, linkages 
between each of the sites may be designed using the detailed structural information and 
5 the fr^Bni^nt linking approach as previously described to produce novel, much more potent 
ligands. S.B. Shuker et al., Science , v. 274, pp. 153 1-34 (1996); C. Verlinde et al.. 
Structure , v. 15, pp. 577-87 (1994). 

In a third application, the scaffold merging approach (not shown), the target active 
site is composed of two or more subsites. The crystalline protein is screened for ligand/s 
10 which bind Via these subsites and the relative ligand binding orientation observed for 

multiple experiments. These ligands should bind by occupying one or more subsites and 
by overlaying the structures of multiple hits a core may be designed that will facilitate 
access to multiple subsites. This core would then serve as a new, novel and more potent 
lead compound which would also serve as the lead scaffold in the drug-design cycle. 

15 The CrystaLEAD™ linked-fragment approach experimentally implements the 

structure-based linked-fragment approach reported only at the computational level by 
Verlinde et al. in J. Comout. Aided Mol. Pes. , v. 6, pp. 13 1-47 ( 1992). Verlinde et al. 
proposed ligand fragments based on mathematical calculations. The proposed fragments 
• were then assayed for binding activity. If the fragments actually bound, their 3-D 

20 structures were determined by X-ray crystallography and a linker designed. By contrast, 
CiystaLEAD™ concuirently detects the binding event and provides an experimentally 
deteimined 3-D structtire of the ligand-protein complex. The invention also provides for a 
process of determining the association constant between a target molecule and its ligand. 
the invention requires no special labeling of the target. Therefore, the target molecule, 

25 can encompass proteins, polypeptides, nucleic acids, nucleoproteins, or any other suitable 
target molecule, that is isolated from natural sources or by recombinant methods from any 
suitable host system as developed and practiced by the ordinary artisan. 

There are several advantages to crystallograpbic screening. One important 
advantage is that the binding event is monitored diiectly so that the probability for felse 
30 positives is reduced to near zero. The crystallograpbic data provide a three dimensional 
electron density "snap-shot" of the ligand-ieceptor complex showing which compound 
binds and how it is bound. 
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The method is uniquely sensitive to striictural changes in both the target and the 
ligand. Observing structural changes is critical in designing scaffolds v^rhich combine 
information from different ligand-targct complex structures. One such example occurs 
when a protein changes stracture in order to accommodate one ligand, but the structure 
change concurrenUy blocks the binding of a second Ugand. Similarly, detecting structural 
changes is also important because if the primary scaffolds bind differently, it may not be 
possible to combine them into a larger scaffold. 

Since the binding event is monitored directly, CiystaLEAD™ does not require 
specially labeled samples, probes or target molecules which would be indirecfly sensitive 
to ligand association. As long as one is able to obtain a crystal structure of the target, one 
can use CrystaLEAD™ to screen for ligands. 

If compound niixtures are suitably designed to be shape-diverse, the invention 
alleviates the need for de-convolution of libraries which arc soaked as a mixture because 
the binding event is detected directly by examining the shape of the electron density at the 
binding site. Thus, the shape of the electron density identifies both the binding event and 
the compound identity directly. Alternatively, one can design the mixture to contain 
compounds with anomalous scattering atoms (e.g. Br, S) that can be identified by 
anomalous scattering techniques. Further, because CiystaLEAD™ directiy monitors 
binding, it is particularly well-suited for studying targets where no known ligand exist. 

Because the electron density function calculated in CrystaLEAD™ shows the "real 
space" of the crystal, one can focus direcUy on the region of interest. Thus, binding may 
be detected exclusively at the site of interest although the method is not limited to the 
active site. Binding at other sites, which complicates analysis in most binding assays, can 
be eliminated from consideration totally. 

CrystaLEAD™ also provides for a method of concurrently monitoring binding at 
different locations. That is, for a target with more than one pocket, screening for a second 
site does not require screening in the presence of the first ligand. However, screening for a 
second site may be completed in the presence of the first ligand in order to discover 
cooperative ligands. 

CrystaLEAD™ is applicable for any target molecule for which a crj^tal structure 
can be obtained. According to current literature, this includes any soluble macromolecule 
with molecular weight between about 5000 and 200,000. However, this range expands 
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almost daily in response to technological advances. The method is also sensitive to a wide 
range of binding dissociation constants (<picomolar to molar). Using more sensitive CCD 
camera detectors, data may be collected in about < 4 hrs to about 4 hours with a rotating 
anode source. This permits the screening of thousands of compounds per detector per day. 
5. _ Using.synchrotrpn sources, the number of compounds screened increase to multiple ' 
thousands per detector, and with Laue data collection methods and testing mixtures, 
CrystaLEAD™ data can be collected in a second or less, thus permitting thousands of 
compounds to be tested per day per beamline. Hence, multiple detectors or a single 
synchrotron beamline facilitates true high-throughput screening. 

10 Rgyre 3 ouUines the invention. Crystals of the target molecule are exposed to one 

or more compounds by soaking the crystal or by co-crystallizing the target m the presence 
of one or more compounds: Then crystallographic data are collected, processed and 
converted to electron density maps which are examined for evidence of ligand binding. 
One way to detect ligand binding is to compare the structure of the original crystal with the 

15 structure of the exposed crystal. 

New targets may be crystallized by published conditions or by other methods well 
established in the art. Similarly, target structures may be available from databases such as 
the Protein Data Bank or could be determined by well established methodology. 
Advances in molecular biology and protein engineering expedite target crystallization 
20 while advances in data collection aid in rapid sttucture determination for targets of 
previously unknown structure. 

Crystals that are exposed to potential ligands by soaking require an empty 
accessible active site. Crystals with an empty active site may be obtained by various 
methods,.,including but not limited to: crystallization in the absence of a ligand; 

25 crystallization in the presence of ligand bound at a distal site; or crystallization in the 

presence of a non-covalent ligand that is easily diluted or exchanged from the target once 
the biomolecule crystallizes. By a novel method, the inventors have obtained crystals 
from a biomolecule by crystallizing the biomolecule in the presence of a degradable ligand 
at the active site and then degrading the ligand once a crystal is formed. Alternatively, it is 

30 possible to grow the crystals in tiie presence of the compounds to be screened. Crystals 
are allowed to equilibrate in the presence of the mixture, at which point the ligands bind as 
a function of their concenti^tion and binding affinity. 
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For the soaking method, the sensitivity of the method may be approximated by 
simple equilibria relationships because the concentration of protein in the crystal may be 
calculated and the concentration of ligand is a known quantity. For example, the 
concentration of a 25,000 MW protein (urokinase) in a crystal is calculated as follows: 
there are 4 molecules in the orthorhombic unit cell (all angles 90°) which has a volume of . 
55 X 53 X 82 A^; using Avogadro's number, the concentration is 28 mM. Therefore, a 
mixture of compounds having a 6 mM concentration for each ligand will result in a 
calculated sensitivity limit of Kd < 1.5 mM (assuming a detection limit of about 80% 
occupancy in the crystal). . 

Soaking mixtures of compounds also raises the question of multiple occupancy 
(more than one ligand binding to the site of interest). For cases of multiple occupancy 
where the ligands are bound in different pockets (see Figure 2), resolution by 
CrystaLEAD™ is easy because the binding at the separate sites can be distinguished 
individually by the electron density maps. For the scenario where different ligands 
compete to occupy the same site, one may use a simple competitive inhibition model to 
calculate the requirements for such binding. From empirical observation, it is believed 
that crystaUography can resolve situations where die occupancy of one inhibitor is 80% 
and another 20%. Therefore, a ratio of binding affinity that is greater than four would 
result in an apparent occupancy by only the higher-affinity ligand. In the unlikely case 
Vvheie the ratio of binding constants of two compounds in the mixture are less than four, 
the resulting electron density would be a weighted average of the two separate densities 
and might be difficult to identify. Accordingly, it would be necessary to conduct further 
soaking experiments to de-convolute the mixture (e.g.. looking at each compound 
ifidividuaUy in separate crystals) only where the ratio of binding affinities is less than four. 
This would still be worthwhile and efficient because it already determines that at least two 
hits are present in the mixture. 

Compounds to be screened are formed into libraries. For the purposes of this 
discussion, libraries are large mixtures of compounds (100-10,000+) and may be general, 
or structure-directed. A general library is random, i.e. fully diverse in size, shape and 
functionality. A structure-directed library is aimed at a particular functional mixmre or 
siibsite in the activ6 site of the target molecule (e.g., a library where all compounds contain 
a carboxylate functionality to be directed towards a positive charge in the target active 
site). 
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In a preferred embodiment, either type of library is divided into smaller groups of 
shape diverse mixtures. Hence, a mixture is defined as a subset of the library which may 
be soaked or grown into the crystal. The mixture is determined to be shape diverse by 
visual inspection of the two dimensional chemical structures or computationally by 
5 programs. Shape diversity of the mixture permits a bound ligand to be identified directly ^ 
from the resultant electron density map (see Figure 4). This eliminates the need for 
follow-up experiments to determine which compound of the mixture is a hit (bound to the 
target). 

If the test compounds are water soluble, typical buffers and precipitant solutions 
1 0 used in crystallization can be used to solubilize the mixtures and soak them into the 

crystal. Less water soluble compounds are dissolved individually to a final concentration 
of 2M in a suitable organic solvent. In one embodiment, they are dissolved in 100% 
DMSO and stored at 4° C, and mixed by mixing the DMSO stocks before exposure to the 
crystal. These mixtures would service most crystal systems where the conditions for 
1 5 crystal growth do not include organic reagents. The compounds would be typically soaked 
to a final DMSO concentration of 1-10% and allowed to equilibrate with the crystalline 
protein for a pre-determined amount of time (4-24 hrs). Under this scenario, each crystal 
is exposed to multiple compounds per soaking mixture. Some crystal growth conditions 
can include a high concentration of organic solvent (40-50%) which are typically alcohol 
20 derivatives. In this case, the compound libraries may be dissolved in the crystallization 
organic solvent which would allow a final co-solvent concentration of 40-50% for the 
soaking experiment. Here, the number of compounds per soaking mixture could increase. 

After soaking, each crystal is exposed to a cryoprotectant such as 5-20% glycerol 
in the soaking mixture, mounted in a nylon loop and placed on the X-ray unit under a 
25 nitrogen cold stream (160K). The crystal studies may also be performed at room 
temperature or other suitable conditions as necessary for the stability of the crystals. 
Automated crystal mounting and changing equipment may be used to accelerate this step 
of the process. 

Crystallographic data are collected and processed where each reflection (spot) on 
30 the diffraction pattern is assigned an index (h,k,l) and the intensity is measured as standard 
. . in the field. X-ray sources may be laboratory x-ray generators or high brilliance 
synchrotron sources that permit diffraction data collection at very high speed. 
Specifically, laboratory data collection may take from 30 minutes to several hours per 
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crystal while the time can be reduced using synchrotron sources. Data collection per 
crystal could be reduced to fractions of a second using Laue data collection schemes. 

The diffraction data are then converted to electron density maps by methods 
familiar. to the ordinary artisan. The electron density maps are the 3-D pictures of the 
ligands and/or the target biomolecules. • - 

For Fo-Fc maps, the calculated structure-factor amplitudes (IFcl) which are 
obtained from the known crystal structure with no ligand bound are subtracted from the 
observed amplitudes (IFol). Thus, this map represents a direct subtraction of the data 
arising from the native protein structure from data arising from crystals soaked in the 
presence of a library mixture. The result is an electron density map which has positive and 
negative peaks. The peaks relevant to CiystaLEAD'^ are the positive ones which are the 
direct result of ligand binding at the site of interest on the target ~ that is the addition of 
the ligand into the target biomolecule. In Figure 3. the Fo-Fc map clearly shows a large 
positive peak at the active site of urokinase, the shape of the peak corresponds to the 
ligand 2-amino-8-hydroxyquinoline. The ligand is shown occupyiiig the positive 
difference density. The other positive peaks correspond to a bound sulfate moiety 
(indicated by S04^ ) and bound water molecules (indicated by HjO). This type of map is 
also very sensitive to small structural changes (indicated by A) that, when used in 
conjunction with 2Fo-Fc maps, allows determination of the detailed structure of the entire 
ligand-protein complex. To calculate the 2Fo-Fc maps, one subtracts (IFcl) from 2(IFol). 
Here, tiie map is positive and has density for all atoms of tiie molecule. 

In Rgure 3, inspection of tiie map indicates the identity and structui« of the bound 
compound. Preferably, the maps of exposed crystals are compared with the maps of the 
unexposed target molecule to differentiate the positive density tiiat may be found in the " 
Fo-Fc map. Sometimes water molecules occupy Uie active site in the crystal in the 
absence of a bound ligand. This is easily differentiated because bound water molecules 
are often oriented in a geometry consistent with hydrogen bonding and because they are 
not connected by a network of covalent bonds. Thus, tiie resultant map tends to be 
disconnected indicating bound solvent rather than an organic compound. If the density in 
the Fb-Fc or 2Fo-Fc map is determined to represent an organic compound, tiie three- 
dimensional shape is compared to that of tiie compounds present in the library and a best- 
fit match is made. Alternatively, programs such as the XFTT modules of QUANTA 
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Gvlolecular Simulations Inc., Quanta Generating and Displaying Molecules, San Diego: 
Molecular Simulations Inc., 1997) can automate this process. 

As the ability to measure or process diffraction intensities improves, one may not 
need to perform the comparison on electron density maps. One may detect binding by 
5 simply comparing the diffraction pattems-of the exposed crystals with the unexposed 
crystals. Therefore, one needs to create an electron density map only if a binding event is 
detected in this pre-screening process. 

As shown above, CrystaLEAD™ can be applied to any biomolecular target for 
which a crystallographic structure can be obtained. Because of its broad applicability, it is 
10 best illustrated by the examples below. 

The urokinase and VanX examples represent two scenarios for the use of 
• CrystaLEAD™. For urokinase, the re-engineered microUK (jiUK) crystals diffract very 
well and are of a high symmetry space group. By contrast, VanX crystals diffract more 
weakly and with lower symmetry. Thus, VanX requires greater data collection time. In 

15 addition, urokinase crystals have one molecule in the asymmetric unit, while VanX has 
six. The larger asynmietric unit requires collection of higher resolution data and makes 
map inspection more tedious. However, in the case of VanX, no non-substrate mimetic 
binders were known before those discovered by CrystaLEAD™. Therefore, 
CrystaLEAD™ provided a novel non-peptidic lead compound to be fed into the drug- 

20 discovery cycle. For urokinase, CrystaLEAD™ provided a novel primary scaffold. 
Applicants were able to rapidly increase the potency of the primary scaffold by using 
existing S AR and crystal structures to design a higher-affinity derivative with improved 
bioavailability over known urokinase ligands. 

However, these examples illustrate the preferred embodiment of the present 
25 invention, and do not limit the claims or the specification. The ordinary artisan will 
readily appreciate that changes and modifications to the specified embodiments can be 
made without departing from the scope and spirit of the invention. Finally, all citations 
herein are incorporated by reference. 
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EXAMPLES 

Example!: Urokinase 

"■ ■ Ufolahasera serine protease Js strongly associated wit^ Urokinase, 
activates plasminogen into plasmin which, in turn, activates the matrix metalloproteases. 
Plasmin and the metalloproteases degrade the extracellular matrix and promote tumor 
growth and metastasis. Thus, inhibitors that specifically target urokinase may serve as 
effective anti-cancer agents. 

Human pro-urokinase consists of 411 amino acids (Fieure 5). Verde et al., Proc. 
Nafl Acad. Sci. . v. 81(5), pp. 4727-31 (1984); Nagai et al., Gene, v; 36(1-2), pp. 183-8 
(1985), When activated by proteolytic cleavage at the Lys'^^-De'^' peptide bond, the 
enzyme becomes two chains connected by a single disulfide bridge (Cys"'^-Cys"'). The 
A-chain (residues 1-158) contains an EGF-like domain and a kringle domain. The B-chain 
(residues 159-41 1) contains the catalytic serine protease domain. Further incubation of 
urokinase results in an additional proteolytic cleavage at the Lys'^^-Lys"^ peptide bond to 
form low-molecular-weight urokinase. Crystals of this enzyme forai in complex with the 
covalent inhibitor Glu-Gly-Arg chloromethyl ketone were obtained by Spraggon et al., 
Saustore, v. 3, pp. 681-91 (1995). and were shown to diffract to 2.5A resolution at a high 
energy synchrotron source. . However, the poor diffraction quality of these crystals together 
with the presence of a covalentiy bound inhibitor makes application of CrystaLEAD™ 
difficult 



uUK Crvst ai Preparation & Structure 

To implement CrystaLEAD"^, human urokinase was re-engineered to consist only 
of residues 159-404 of the B-chain where Asn'^^ was replaced with a glutamine to remove 
a glycosylation site and Cys^" was replaced with an alanine to remove tiie free sulfhydryl 
moiety. This form of urokinase (pUK) was shown to be fully active and was found to 
crystallize in a crystal form compatible with CrystaLEAD™. (See. also, U.S. Patent No. 
5,1 12,755. issued May 12, 1992, to Heyneker et al.) 
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Preparing Vector Construct DBC-LMW-UK-Ala^"^^ 

Mutants of human UK were cloned into a dicistronic bacterial expression vector 
pBCFK12. Pilot-Matias et al.. Gene , v. 128, pp. 219-25 (1993). The foUowing oligo 
5 nucleotides were used to generate various UK mutants by PGR: 

SEQ ID # SEQUENCE OF PCR PRIMER 

1 5 ' -ATTAATGTCGACTAAGGAGGTGATCTAATCTTAAAATTTCAG'rGTGGCCAA-3 ' 

2 5 ' -ATTAATAAGCTTTCAGAGGGCCAGGCCATTCTCTTCCTTGGTGTGACTCCTGATCCA- 3 ' 

3 5 • -ATTAATTGCGCAGCCATCCCGGACTATACAGACCATCGCCCTGCCCT-3 ' 

The initial cloning of a low molecular weight UK, hereinafter designated LMW- 
UK (L""-L'" ') was performed using human UK cDNA as template and SEQ ID NOs: 1 

10 and 2 as primers in a standard PCR reaction. The PCR amplified DNA was gel purified 
and digested with restriction enzymes Sail and Hindm. The digested product then was 
ligated into a pBCFK12 vector previously cut with the same two enzymes to generate 
expression vector pBC-LMW-UK. The vector was transformed in DH5a cells (Life 
Technologies, Gaithersburg, MD), isolated and the sequence confirmed by DNA 

15 sequencing. The production of LMW-UK in bacteria was analyzed by SDS-PAGE and 
zymography, Granelli-Pipemo et al., J. Exp. Med. , v. 148. pp. 223-34 (1978), which 
measures plasminogen activation by UK. That LMW-UK was expressed in E. coli, and 
that it was active in the zymographic assay was demonstrated by commassie blue stained 
gel. 

20 The success of the quick expression and detection of LMW-UK in E. coli made it 

possible to perform mutagenesis analysis of UK in order to determine its minimum 
functional structure. One mutant having a Cys"' to Ala"' replacement was made with 
, SEQ ID Nos: 2 and 3 by PCR. The PCR product was cut with AviH and Hind m, and used 
to replace a Avill and Hind HI fragment in the pBC-LMW-UK construct. The resulting 

25 pBC-lMW-UK-Ala"' construct was expressed in E. coli and the product shown to be 
active in zymography. 
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Cloning and Expressing uUK (IT Kfl'^^^K^^^A^^^Q^'^-i in Baculdvinif; 

pUK (UK amino acids De'^^-Lys^ that contain Ala^^^Gln^^^) was generated by 
PGR with the following oligonucleotide primers: 

SEQUENCE OF PCR PRIMER 

ATTAATCAGCTGCTCCGGATAGAGATAGTCGGTAGACTGCTCTTTT- 3 ' 
ATTAATCAGCTGAAAATGACTGTTGTGA-3 ' 

ATTAATGTCGACTAAGGAGGTGATCTAATGTTAAAATTTCAGTGTGGCCAA- 3 ' 
ATTAATGCTAGCCTCGAGCCACCATGAGAGCCCTGCT- 3 ' 
ATTAATGCTAGCCTCGAGTCACTTGTTGTGAGTGCGGATCCA-3 ' 
GGTGGTGAATTCTCCCCCAATAATGCCTTTGGAGTCGCTCACGA-3' 

to mutate the only glycosylation site (Asn^^^) in UK, oligonucleotide primers SEQ 
ID NOs: 4 and 6, and SEQ ID NOs: 5 and 8 were used in two PGR reactions with pBC- 
LMW-UK-Ala^^^ as the template. The two PCR products were cut with restriction enzyme 
Pvu n, iigated with T4 DNA ligase, and used as template to generate LMW-UK-A^^^-Q^^l 
10 In the meantime, native UK leader sequence was fused directly to De^^^ by PCR with SEQ 
ED NOs: 7 and 9 using native UK cDNA as the template. 

This PCR product was used as a primer, together with SEQ ID NO: 8, in a new 
PCR reaction with LMW-UK-A'''^q'^' DNA as template to generate mUKcD^^ fiUK 
was cut with Nhe I and Iigated to a baculovirus transfer vector pJVPlOz cut with the same 
15 enzyme. Vialard et al, J. Virology, v. 64(1); pp. 37-50, (1990). The resulting construct. 
pJVPlOz-pUK was confirmed by standard DNA sequencing techniques. 

Construct pJVPlOz-fiUK was transfected to Sf9 cells by the calcium phosphate 
precipitation method using the BaculoGold kit from PharMingen ( Sari Diego, CA). Active 
HUK activity was detected in the culture medium. Single recombinant virus expressing 
20 jiUK was plaque purified by standard methods, and large stock of the virus was made. 

Large scale expression of fiUK was made in another line of insect cells, High-Five 
cells (Invitrogen, Carlsbad, CA), in suspension growing in Excel 405 serum-free medium 
(JRH Biosciences, LeneXa, KS) in 2 liter flasks, shaking at 80 rpm, 28^C. High-Five cells 
were grown to 2 X 10^ cells/ml. recombinant jiUK virus was added at 0.1 multiplicity of 



SEQ ID # 
4 
5 
6 
7 
8 
9 



5' 
5' 
5' 
5' 
5' 
5' 
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infection, and the culture was continued for 3 days. The culture supernatant was harvested 
as the starting material for purification. The activity of pUK in the culture supernatant 
was measured by amidolysis of a chromogenic UK substrate S2444. which was at 6-10 
mg/liter. Claeson et al., Hacmostasis . v. 7, p. 76 (1978). 



Expressing uUK in Pichia pastoris 

To express pUK in Pichia, an expression vector with a synthetic leader sequence 
was used. The Pichia expression vector, pHil-D8, was constructed by modifying vector 
pHil-D2 (Invitrogen) to include a synthetic leader sequence for secretion of a recombinant 
10 protein. The leader sequence 5'- 

ATGTTCTCT CCAATTrrGTCCTTGGAAATTATTTTAGCI^ 
mnrCITCGCTCAGCCAGTTATCTGeAC^^ 

GATCC-3' (SEQ ID NO: 10) encodes a PHOl secretion signal (indicated by the single 
underline) operatively linke^i to a pro-peptide sequence (indicated in bold) for KEX2 

15 cleavage. To construct pHil-DS, PGR was performed using pHil-S 1 (Invitrogen) as 

template since this vector contains the sequence encoding PHOl, a forward primer (SEQ 
ID NO: 1 1) corresponding to nucleotides 509-530 of pHil-S 1 and a reverse primer (SEQ 
ID NO: 12) having a nucleotide sequence which encodes the latter portion of the PHOl 
secretion signal (nucleotides 45-66 of SEQ ID NO: 10) and the pro-peptide sequence 

20 (nucleotides 67-108 of SEQ ID NO: 10). The primer sequences (Obtained from Operon 
Technologies, Inc. Alameda, CA) were as follows: 



SEQ ID » SEQUENCE OF PCR PRIMER 

11 5 ' -GAAACTTCCAAAAGTCGCCATA-3 ' 

12 5 ' -ATTAATGAATTCCTCGAGCGGTCCGGGATCCCTCGGCAGCGGAACCAACGGTAGTGCAG 

ATAACTGGCTGAGCGAAGACAGATTGCAAAGTA- 3 ' 

Amplification was performed under Standard PCR conditions. The PCR product 
25 (approximately 500 bp) was gel-purified, cut with Blpl and EcoRI and ligated to pHil-D2 
cut with the same enzymes. The DNA was transformed into £, coli HB 101 cells and 
positive clones identified by restriction enzyme digestion and sequence analysis. One 
. clone having the proper sequence was designated as pHil-D8. 
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The following two oligonucleotide primers then were used to amplify p UK for 
cloning into pHil-D8. 



■ ^^Q ^ SEQUENCE OF PCR PRIMER 

13 5 ' -ATTAATGGATCCTTGGACAAGAGGATTATTGGGGGAGAATTCACCA-3 ' 

14 S'-AlT-AATCTCGAGCGGTCCGTCACTTGGTGTGACTGCiSAATCCAGGGT-S' 

The PCR product was obtained with SEQ ID NOs: 13 and 14 using pJVPlOz-nUK 
as the template. The amplified product was cut with BamHI and Xhbl and ligated to pHil- 
D8 cut with the same two enzymes. The resulting plasmid, pHilD8-MUK, was confirmed 
by DNA sequencing, and used to transform a Pichia strain GSl 15 (Invitrogen) according 
to the supplier's instructions. Transformed Pichia colonies were screened for pUK 
expression by growing in BMGY medium and expressing in BMMY medium as detailed 
by the supplier (Invitrogen). The pUK activity was measured with chromogenic substrate 
. S2444. The pUK expression level in Pichia was higher than that seen in baculovirus-High 
Five cells, ranging from 30-60 mgAL. 



Purifvine pUK 

The culture supemant of either High Five cells or Pichia were pooled into a 20 Uter 
container. Protease inhibitors, iodoacetamide, benzamidine and EDTA were added to a 
final concentration of about 10 mM, 5 mM and 1 mM, lespectively. The supernatant was 
then diluted 5-fold by adding 5 mM Hepes buffer pH7.5 and put through 1.2 p and 0.2 p 
filter membranes. The pUK was captured onto Sartorius membrane adsorber SlOO 
(Sartorius, Edgewood, NY) by passing through the membrane at a flow rate of 50-100 
ml/min. After extensive washing with 10 mM Hepes buffer, pH7.5, 10 mM 
iodoacetamide, 5 mM benzamidine, 1 mM EDTA, pUK was eluted from S 100 membrane 
wiUi a NaCl gradient (20 mM to 500 mM. 200 ml) in 10 mM Hepes buffer, pH7.5. 10 mM 
iodoacetamide. 5 mM benzamidine, 1 mM EDTA. The eliiate (-100ml) was diluted 10 
times in 10 mM Hepes buffer containing inhibitors, and loaded to a S20 column (BioRad. 
Hercules, CA). pUK was eluted with a 20x column volume NaCl gradient ( 20 mM to 
.500 mM). No inhibitors were used in the elution buffers. The eluate was then diluted 5- 
fold with 10 mM Hepes buffer. pH7.5, and loaded to a heparin-agarose (Sigma) column. 
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mUK was eluted with a NaCl gradient from 10 mM to 250 mM. The heparin column 
eiuate of pUK (-50 ml) was applied to a benzamidine-agarose (Sigma, St. Louis, MO) 
column (40 ml) equilibrated with 10 mM Hepes buffer, pH7.5, 200 mM NaCl. The 
column was then washed with the equilibration buffer and eluted with 50 mM NaOAc, pH 
5 4.5, 500 mM NaCl. The nUK eiuate (-30 ml) was concentrated to 4 ml by ultrafiltration 
and applied to a Sephadex G-75. column (2.5x48 cm; Pharmacia® Biotech, Uppsala, 
Sweden) equilibrated with 20 mM NaOAc, pH4.5, 100 mM NaCl. The single major peak 
containing pUK was collected and lyophilized as the final product. The purified material 
appeared on SDS-PAGE as a single major band. 

10 High-quality pUK crystals facilitated deteraiination of its apo-thrce-dimensional 

stnicture by X-ray crystallography to l.oA resolution. Crystals were obtained by the 
hanging drop vapor diffusion method. Typical well solutions consisted of 0.15M Li2S04. 
20% polyethylene glycol MW 4000 and succinate buffer pH 4.8-6.0. On the cover slip, 2 
Ml of well solution were mixed with 2 jil of protein solution and the slip sealed over the 

15 well. Crystallization occurred at approximately 18-24''C within 24 hrs. The protein 
. solution contained 6 mg/ml (0.214mM) pUK in 10 mM citrate pH 4.0, 3 mM e-amino 
caproic acid p-carbethoxyphenyl ester chloride (inhibitor) with 1% DMSO co-solvent. 
. The inhibitor utilized in the corcrystallization is believed to acylate the active site serine 
195 and is subsequently deacylated enzymatically, because, the 3-D X-ray structure of 

20 crystals grown in the presence of this compound show no inhibitor remaining in the 

enzyme active site. Menegatti et al., J. Enzvme Inhibition , v. 2, pp. 249-59 (1989). The 
only density present is that due to bound solvent molecules. Because pUK will not 
crystallize in the absence of the inhibitor, the meta-stable inhibitonUK complex is 
believed to be the crystallization entity. Importantly, the resultant |iUK crystals are 

25 composed of enzyme with an empty active site which is the ideal case for implementation 
ofCrystaLEAD™. 

Crystals obtained under these conditions belong to the space group P2i2i2i with 
unit cell dimensions of a=55. 16A b=53.00A c=82.30A and a=p=Y=90°. They diffract to 
.beyond 1 .SA on a rotating anode source. Further, a 1 .oA resolution native data set has 
30 been collected at the Cornell ffigh Energy Synchrou-on Source in Ithaca, New York. The 
crystal structure was determined by the molecular replacement method using the AMORE 
,program, Navaza, J. Acta Crvst.. A50:157-163 (1994), with the low-resolution urokinase 
stnicture as the search probe, Spraggon et al., Stnicture . v. 3, pp. 681-691 (1995); PDB 
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15 



entry ILMW. The structure was refined using the XPLOR program package, A. Brunger, 
X-PLOR (version 2. 1 ) Manual, Yale University. New Haven CT (1990). 

Screening for Weak Bases 

5 The |iUK was screened against a structure-directed library in order to find a novel 

primary scaffold which would have favorable pharmacokinetic properties. Since the 
urokinase active site is composed of one primary pocket that contains a free carboxylate 
moiety in the form of an aspartic acid (Asp"*'), most well-known scaffolds are strongly 
basic and contain amidine or guanidine moieties. The basic group has been found to 
.10 hydrogen bond salt-link with Asp"^' This can be a problem pharmacologically since 
strong bases are known to decrease oral bioavailability. Accordingly, a weakly basic 
library containing compounds that were not previously known to be urokinase binders was 
selected. 

A weak base library containing 6 1 compounds with pKa between about 1 arid 9 
was located in the available chemicals directory (ACD). The library was broken down 
into 9 mixtures of about 6 to 7 shape-diverse compounds, as determined by visual 
; inspection of the two dimensional chemical structure. The compound mixtures were 
screened by the method described above. SpecifieaUy, each compound was dissolved in 
100% DMSO to a final concentration of about 2M (or saturation for the less soluble). 
Equal volumes of each of the 6 or 7 compounds comprising the mixture were mixed to a 
final individual compound concentration of 0.33M. Single nUK crystals were placed in 
50ml of 27% PEG4000. 15.6mM succinate pH 5.4. 0. 17M U2SO4 and 0.5-0.8ml of the 
compound mixture added to give 1 to 1.6% DMSO and 3.3 to 5.2mM final individual 
compound concentratioh. Under these conditions the sensitivity of the experiment is 
expected to detect binders with Kd<1.0mM. Crystals were allowed to equilibrate for 
about 8-24his. 

Data were collected on a Rigaku RIP 300 RC rotating anode source with a 
RAXISn or MAR image plate detector. Typical data consisted of 45-50 2« oscillations 
with 2-5inin exposures. Typical data were 70-90% complete at 2.0-3.0A resolution with 
merging R-factors of 13-26%. Hence, the data quality ranged from fair to poor due to the 
rapid data coUection protocol. However, this quality of data was shown to be adequate for 
the detection of binders primarily due to the high quality of the starting model which had 
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been refined to 1 .SA resolution (R=20.7% Rfree=25.3%). Data were processed by the 
DENZO program package, Otwinowski et al., Methods in Enzvmologv . 276 (1996), and 
. the electron density maps calculated by the XPLOR package. 

Electron density maps were inspected on a Silicon Graphics INDIG02 workstation 
5 using the QUANTA 97 program package (Molecular Simulations Inc., Quanta Generating 
and Displaying Molecules, San Diego: Molecular Simulations Inc., 1997). The shape of 
the density at the active site was visually identified as resulting from one (or more) of the 
compounds in the mixture indicating a positive hit or from ordered water molecules 
indicating the absence of binding. For experiments which resulted in a positive hit, the 

10 appropriate compound was visually moved into the electron density. The electron density 
maps were also checked for any changes in the protein structure and if observed, the 
appropriate modifications were made. Hence, after the map inspection/compound fitting 
step, the three-dimensional structure of the compound:protein complex was known. The 
urokinase example utilized visual movement of the compound into the density because the 

15 screening was still on a small scale. When expanded to larger scale compound screening, 
commercial programs such as the XFIT module of QUANTA will facilitate automatic 
fitting of the compound to the density. 




X-001 42753-71-9 42712-64-1 

4 5 6 



Figure 6 shows an example of a positive hit. The compounds screened are 
25 numbered 1 through 6 and the Fo-Ft elesctron density map at the active site is shown at 
Egure 6A. The shape of the density identified the binder as compound 5. Figure 6B 
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shows the detailed binding mode of the compound in the primary specificity pocket as 
obtained direcUy by interpretation of the CrystaLEAD™ electron density map. The amino 
nitrogen hydrogen bonds with the Asp'"' carboxyl and the pyrimidyl nitrogen hydrogen 
bonds with a backbone carbonyl (Gly^"). The structure also shows that the ideal site for 
modification would be at the pyridyl methylr- - - 



51-78-5 2198-58-5 22013-33-8 
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Another mixture of compounds (compounds 7 through 13) did not produce any 
hits. The resulting electron density map after soaking this group did not correspond to that 
of any of the tested compounds in this mixture. Instead, they conespond to bound solvent 
molecules. See Figure 6C. 
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14268-66-7 
"18 




Egure 7 shows another example of a positive hit. Of the seven compounds 
screened (14-20), the Fo-Fc map shown in Figure 7A indicates that compound 19 is 
bound. The binding mode depicted in Figure 7B shows that the 2-amino is hydrogen 
bonding with the Asp 189 side chain and that the 8-hydroxyl is an ideal site for substitution 
in order to access the adjacent hydrophobic sub-pocket (denoted as S ip in Figure 7B). 
Figure 8 represents another hit where compound 22, 5-aminoindole, (Figure 8A) was 
found to bind to urokinase with the amino group hydrogen bonding with Aspl89 (Figure 
8B). Compounds screened were compounds 21-27. 
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. 71026-66-9 

Figure 9 shows an example where two compounds from the same mixture 
(compounds 28-34) were found to bind without multiple occupancy problems. In the 
initial experiment where the crystal was soaked in the presence of the entire compound 
mixture, compound 28 was found to bind (Figure 9A). In addition, when the weaker 
binding compound 31 was soaked individually (based upon previous structure activity 
relationships established through CrystaLEAD™) it was also found to bind (Figure 9B). 
In a more typical application of the method, a library would be re-soaked in the absence of 
the tighter binder in order to detect weaker binders in the mixture, if desired. 
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Table I summarizes the inhibition constants for each of the CrystaLEAD™ hits as 
determined by pyroGlu-Gly-Arg-pNA/HCl (S-2444, Chromogenix) chromogenic activity. 
Assays were completed at both pH 6.5 (0.1 M NaP04) and 7.4 (50mM Tris). Other 
conditions of the assay were 150mM NaCl. 0.5% Pluronic F-68 detergent, 200mM S- 
2444, with a final DMSO concentration of 2.5%. The Km of the substrate was determined 
to be 55|iM. 

Table 1: fchibition constants and pKa for hits detected by CrystaLEAD™ 



Compound (CAS #) 


Ki 


Ki 


pKa 




(pH6.5) 


(pH7.4) 




5(42753-71-9) 


»500pM 


»500nM 


6.0* 


19 (70125-16-5) 


56|jM 


137pM 


7.3 


. 22 (65795-92-8) 


200|iM 


>500mM 


6.0* 


28(580-22-3) 


71mM 


136|iM 


7.3 


31 (1603-41-4) 


»500|iM 


»500mM 


7.0* 



10 • indicates estimated pKa 

Based upon the activity and structural information, compound 19 was chosen as 
the lead compound. Crystallographic information indicated that substitution at the 8- 
position should allow access to the adjacent hydrophobic pocket (S 1 P) pocket and thereby 

15 result in an increase in potency. Based upon crystallographic and binding information 
from an amidine-based series, compound 35 was synthesized (the 8-aminopyrimidinyl 
analog of compouiid 19). This modification resulted in about a 200 fold increase in 
binding potency at pH 6.5 (Ki pH7.4=2.5MM; Ki pH6.5=0.32nM). The experiment 
indicates that CrystaLEAD™ can provide both a lead scaffold and the detailed smictural 

20 information necessary to elaborate that scaffold through structure-based drug design into a 
more potent compound. 
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• 35 

In Figure 10, an overlay of the crystal compound 3S:urokinase and the parent 
compound 19 are shown. The overlay shows that the aminopyridine ring is bound in the 
hydrophobic sub-pocket (Sip) pocket as predicted and that this substitution results in 
movement of the quinoline ring towards this site. 

Compound 35, the 8-aminopyrimidinyl-2-aminoquinoline, was also tested for oral 
bioavailability. Compound 35 was determined to be 30-40% orally bioavailable in tiie rat 
when administered at a lOmg/kg dose. Hence, successful implementation of 
CrystaLEAD™ resulted in a novel lead scaffold which through one cycle of structure- 
based drug design produced a compound having a 200-fold increase in potency, and was 
found to be orally bioavailable. 

Example 2: VanX 



Vancomycih is the drug of choice for infections caused by streptococcal or 
staphylococcal bacterial sti-ains that are resistant to p-lactam antibiotics. However, strains 
of vancomycin resistant bacteria have now been found for this drug of last recourse. Some 
investigators have associated VanX, a metalloproteinase, with vancomycin resistance. 
VanX is part of a cascade that results in replacement of the teraiinai D:Ala-D-Ala moiety 
of the bacterial peptidoglycan chain (the binding site for vancomycin) with a D-Ala-D- 
lactate. This results in a 1000-fold decrease in vancomycin binding. The only known 
inhibitors of VanX are peptides or peptide derivatives, such as phosphonate or phosphinate 
analogs of Uie D-Ala-D-Ala substrate. As such, they are hot suitable drugs because they 
are metabolized and/or degraded in vivo. Initial attempts to find suitable drugs by normal 
screening methods did not uncover a suitable ligand. Subsequently. Applicants turned to 
CrystaLEAD™ to find a non-peptide lead compound for drug development towards a 
ti-eatment for tiiese resistant strains. 
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10 



VanX Preparation 

E-coliW31 10 containing piasniidpGWl. in which the vanX gene is under control . 
of the IPTG-inducible tac promoter, was grown at 37 »C in LB medium containing 

ampicillin(100Mg/ml)toanabsorbanceofabout 1.3-1.5 at 595 nm. Then IPTG was then 
added to a final concentration of 0.8 mM, and the cells were grown for an additional 1.5 
hours. 

Cells were harvested by centrifugation at 6000 ipm for 10 min. Then, the pellet 
was resuspended in ice cold 20 mM Tris-HCl (pH 8.0) containing 0.01% NaNj, 1 mM 
MgCl,, ImM PMSF, 1 mM DTT (Buffer A) and 25 units/ml of benzonase (Nicomcd 
Pharma. Copenhagen. Denmark). The cells were lysed by the addition of 0.1 micron 
zirconia ceramic beads to the lysate mixture (1:1 v:v) with a 1-3 minute run in a Bead 
Beater (Biospec). an ultrasound bead mill. The Bead Beater was run with an ice-packed 
reservoir to maintain a chilled lysate. Then, the lysate was decanted away from the settled 
15 glassbeads. The beads were then rinsed with 1-2 volumes oflysis buffer, and the washes 
were then pooled with the original lysate. The lysate was centrifuged at 25000g for 30 
minutes to settle cell debris. The supernatant was dialyzed overnight at 4 »C in 50mM 
Tris-HCl, pH 7.6, 1 mm EDTA, and ImM DTT (Buffer B). 

Thereafter, the dialyzed lysate was loaded onto a Q-sepharose fast flow column, pre- 
equiUbrated in Buffer A at a rate of four millimeters per minute. The column was exhaustively 
washed with the Buffer A followed by a linear gradient of Buffer B to Buffer B+0.5 M NaCl. The 
active VanX fractions from this step were pooled, concentrated and then applied to a Superose-75 
column in Buffer B. VanX fractions from the Superose column run were then applied to a Source-Q 
column in Buffer A at a flow rate of 2 inl/iin. The column was washed with starting buffer for 
several column volumes. Then the VanX protein was eluted off with a shallow gradient of Buffer A 
to Buffer A+25mM NaQ. The active VanX fractions from this final step wer« concentrated to a 
final concentration of approximately 15 mg/ml in Buffer A with Amicon filters. Unless otherwise 
specified, the foregoing procedure was run at 4 »C. As purified, tiie VanX protein was 
approximately 95% pure and readily crystallized. 



20 



25 
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VanX Crystal Stmrtnro 

The crystal structure of VanX was determined at 2.2 A resolution by multiple 
isomorphous replacement. Bussiere c/ al:. Molecular Cell . Vol. 2. pp 75-84 (1998). The 
recombinant protein obtained above was crystallized in the space group P2i by the sitting 
drop vapor diffusion method. Typical crystals had unit cell dimensions of a=83.4A. 
b=45.5A, c=171.4A, a=r=90°, p=104° with six molecules in the asymmetric unit. Typical 
well solutions consist of O.IM Mes pH 6.4, 0.24M ammonium sulfate, and 20% PMME 
5000. On the sitting drop microbridge (Hampton USA), 2ml of protein are mixed with 
2ml of well solution and the chamber sealed with a cover slip. Crystallization occurs at 
IS'C, and the crystals grow to full size in about 2-3 days. The protein solution is 
composed of 12-15mg/ml (0.5-0.6mM) VanX in lOmM Tris. ISmM DTT, pH 7.2. The 3- 
D structure for crystals grown under these conditions show an empty active site making 
this a system highly suitable for application of CrystaLEAD"^. 

The VanX active site has an extended pocket capable of accommodating the D- 
Ala-D-Ala substrate. The pocket also contains a catalytic zinc. Thus, for this case, VanX 
was initially screened against zinc directed libraries in order to find multiple binding 
scaffolds which could be merged into a single lead compound. Three libraries utilizing 
amino-acid, thiol, hydroxamic acid or caiboxylate moieties directed towards zinc were 
screened. 



Screening 

The amino acid library consisted of 102 compounds of opticaUy pure commercially, 
available natural and non-naturally occurring amino acids. The library was divided into 12 
mixtures of 8-10 shape-diverse compounds and screened by the method described above. 
Specifically, each compound was dissolved in 100% DMSO to a , final concentration of 2M 
(or saturation for tiie less soluble). Equal volumes of each compound of each mixture 
were mixed to a final individual compound concentration of 0.33M. Single VanX crystals 
were placed in 50ml of O.IM Mes pH 6.4, 0.24M ammonium sulfate, 20% PMME 5000 
and 0.5-0.8ml of the compound mixture added to give 1 to 1.6% DMSO and 3.3 to 5.2mM 
final individual compound concentration. Crystals were allowed to equilibrate for 3-4 hrs. 
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The thiol, hydroxamic and carboxylate libraries were prepared and screened in a similar 
manner. 

Data were collected on a Rigaku RTP 300 RC rotating anode source with a 
RAXBn, MAR image plate, or MAR CCD detector. For the image plate systems, typical 
5 data consisted of 90 1 .25" oscillations with 1 5min exposures while for the CCD 1 00 1 .0° 
oscillations were exposed for two minutes. Typical usable data were >90% complete at 
2.6-2.8A resolution with merging R-factors of 10-20%. This was required to adequately 
visualize and identify inhibitor^! in the Fo-Fc or 2Fo-Fc maps. For these maps, the starting 
model had been refined to 2. 1 A resolution (R=25% Rfree=28%). Data were processed by 

10 the DENZO program package and the electron density maps calculated by the XPLOR 
package. In the presence of some compounds of the carboxylate library, the space group 
was shown to shift from P2i to C2 (a=170.6A, b=47.5A, c=83.6A, a=y=90°. p=104°). 
For this form, the asymmetric unit contained a trimer thereby reducing the number of 
degrees of freedom so that lower resolution data (3.0A) were adequate for visualization of 

15 binding. 

Electron density maps were inspected on a Silicon Graphics INDIG02 workstation 
using QUANTA 97. The shape of the density at the active site was visually identified by 
the shape of one or more of the compounds in the mixture to indicate a positive hit or by 
ordered water molecules indicating the absence of binding. For experiments which 
20 resulted in a positive hit, the appropriate compound was visually moved into the electi'on 
density. The electron density maps were also checked for any changes in the protein 
structure, and if observed, corresponding modifications were made in the stiiicture. 
Hence, after the map inspection/compound-fitting step, the detailed 3-D structure of the 
compotmd:protein complex was known. 



25 



580-22-3 329-89-5 132-32-1 

36 37 38 
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1603-41-4 137-09-7 24313-88-0 

. . V- 40 41 

5 Currently 6 hits have been detected in the VanX screens (compounds 36-41). 

Figure 1 1 shows the binding mode of representative hits. In all cases, the electron density 
shape identified the binding compound. Figure 1 1 A shows compound 39 bound with the 
Carboxylate coordinating to the active site zinc. Figure 1 IB , shows compound 36 bound 
with the carboxylate poindng towards the active site zinc. In Figure 1 1 C, compound 37 

10 wias also found to bind through the carboxylate. The binding of compound 39 and 
compound 41 (not shown) suggests that the active site zinc prefers coordination of a 
carboxylate over a free thiol. This led to screening of a carboxylate library where 
additional hits were found. In all cases, the compounds were screened in mixtures of 7-10 
and the hit directly identified by the shape of the electron density map. These hits are fed 

1 5 directiy into die structure-based drug design cycle in a manner similar to that described for 
the urokinase example. 



Exaiinple3: Screening with Mixtures of 100 Comp ounds 

Ih order to increase the number of compounds that may be screened per unit time 
by the CiystaLEAD™ method, a preferred embodiment of the method would be to screen 
mixtures of 100 compounds rather tiianmixtiires of 10. The advantage of this method is a 
higher compound throughput witii a concurrent lowering of the sensitivity of the hit 
detection. In addition, since only the most potent compound in a mixture will bind, 
weaker hits may be missed. When a general library, for example, one which is fully 
diverse in size, shape and functionality, is screened by CrystaLEAD™, tiie hit-rate is 
expected to be low. Therefore, a more coarse screen is warranted. In addition, since the 
hits from this screen would be the more potent binders, they could serve as starting 
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scaffolds for structure-based drug design. Since the compound mixture will be composed 
of 100 compounds, the mixture should be carefully designed in order to ensure that all 
members would be diversely shaped enough to eliminate the need for deconvolution. 
Hence, upon hit detection, some deconvolution may be necessary to identify the hit. 

5 *• - 

To test this particular method, a compound known to bind to ^lUK was added to a 
group of 100 compounds. This known binder, compound 19, was originally discovered 
by the CrystaLEAD™ method and shown to bind to ^UK with a Ki of 56nM at pH 6.5 
and 137|iM at pH 7.4. The 100 compound mixture was constructed by mixing 10 

10 mixtures of 10 compounds. Specifically, each dry mixture of 10 was dissolved in 100% 
DMSO to a final concentration of about 80-240mM(or saturation for the less soluble). 
Equal volumes of each of the mixtures of 10 compounds were mixed to a final individual 
compound concentration of 8.0-24.0mM and the mixture spiked with a 100% DMSO 
stock of compound 19 such that the final concentration was 18.0mM. Single pUK crystals 

15 were placed in 50jxl of 27% PEG4000, 15.6mM succinate pH 5.4, 0.17M Li2S04 and 

0.5|iLof the compound mixture added to give 1% DMSO. The final concentration of each 
compound in the soak experiment ranged from 80-240fiM, the concentration of compound 
19 was 180|iM. Under these conditions the sensitivity of the experiment is expected to 
detect binders with Kd<20-60^iM. Crystals were allowed to equilibrate for 4 hours and 15 

20 minutes. 

Data were collected at the Argonne National Labs advanced photon source 
synchrotron ID beamline IMC A equipped with a MarCCD camera. Data consisted of 100 
1** oscillations with 7 sec exposures. Data were 87.4% complete at 1.6A resolution with 
an overall merging R-factor of 5.4%. Data were processed by the DENZO program 
25 package, Otwinowski et al., Methods in Enzvmologv . 276 (1996), and the electron density 
map calculated by the XPLOR package. 

The electron density map was inspected on a Silicon Graphics IND1G02 
workstation using the QUANTA 97 program package (Molecular Simulations Inc., Ouanta 
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GeneratinR and Displayinf^ Molernlps . San Diego: Molecular Simulations.Inc, 1997). The 
shape of the density at the active site was visually idenUfied as resulting from one of the 
compounds in the mixture indicating a positive hit which was identified as 
, compound 19, and is illustrated in Figure 12. 

5 

This method is preferable for discovering lead compounds. Lead compounds 
would typically have the characteristics of being tighter binders (for example, within the 
sensitivity range of the method). This method also allows screening of a 1 0,000 
compound non-directed library on the timeframe of 1-2 weeks. This method would be 
10 used in conjunction with the other methods of screening 10-20 compounds at a time where 
weaker binders would be identified. These binders would be less likely to serve as lead 
compounds, but could be attached to a lead scaffold in order to increase the potency. 



15 Example 4: CrvstaLEAD™ screeninp of Frmr- 

ErmC is an rRNA methyltransferase that transfers a methyl group from S- 
- Adenosyl-L-methionine to N6 of adenine within the peptidyltransferase loop of 23S 
rRNA. this methylation confers antibiotic resistance against a number of macrolide 
20 antibiotics such as the widely prescribed erythromycin. Inhibition of ErmC would be 
expected to reverse resistance. In order to design a specific and potent ErmC inhibitor, 
the cofactor or S-Adenosyl-L-methionine binding site has been targeted. S-Adenosyl-I^ 
methionine is illustrated below as compound 42: 
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The crystal structure of ErmC Shows that the S-Adcnosyl-L-methiohine site is 
composed of two primary pockets which accommodate the adenine ring and the 
methionine. In addition, there is a third pocket which may accommodate the rRNA 
adenine that undergoes methylation. In order establish an SAR at this site, a library of 
adenosine analogues subsUtuted at N6 and/or 5^ hydroxyl was generated. The sites of 
variation in the library is represented below as compound 43.. 




43 
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ErmC expr ession and purification 

'^ "'""''^"-^o^pmrni, was commicttd by polynias, chain „^„, 
(PCR) mphfcaaon of e™C' gene and the upstteam k*B clstron torn pERM-1 
Subclonmg U,= PCR pr„a„e. i„,o pBT24. (Novagen. Madison. W.) was perfonned using 

Bamm and H„dm siKs included in 1,= ".ailed" PCR primes. This new consBuc. 
*wed ,he exp,^ion of E™C- by ttansIaUonai coupling ,o kdsB. under .he conuol of 
U« TTfacpromottr. pTERM31 plasmid was uansformed in.o £. coli smin 
BU.9(DE3Vpl,y^ (Novage.) and ^ resulting „^ ^^^^^ 
. Wonned cells were grown a. 27.5-C in a New Bmnswick Scien.iflc (Ed«on. NJ) 
Micros fe,™en.or confining 10 1 of Superbroft (BIO 101. U Jolla. CA). supplemented 
wuh ^amycm. chloramphenicol, and glucose. When U,e cul.u,e optical densi^r .eached 
1.10.EnnC «pression was induced by *e«|diti„„ of ImMisopropylM- 
toogalaaopyranoside (IPTO). Cells wen, harves.ed 400 minutes pos.-imiuction 

v„„ Tm? " -'^P'n'fcO te» 5-10 

volumes of cold lys,s buffer (50mM Tris. 5mM 1.4sJitiuotin«tol (DTT) ImM 

Cn'xTt^r 1"°^" ^^'-'-Oi— '"-ceiic acid TO 0.2% 
T„ on X-IOO. ph 7.8). The cells were lysed witi, a French pr^ss and cell debris amoved by 

^"I^^^Trr'^'^^'^^^-^^'^^'-fris-DTT-glycerol-magnesium 

to ap^.ed .0 a Sepharr«e Fas. Plow column (Phannacia) ti,a. had been pre.,uilibra.ed in TDGM 
buffer F->»-w«^.yedformea.yluansf.,.s..cttvi.yandti,osec„„,ainingEnnC' were 

Naa gr^hent The punfied protein was U»n c«.cen«.ted on a YM-IO (A,nic„n) membrane. 

ErmC Crystal fSfmrturA 

Cn-sW^ofEnnCwem grown by Changing drop vapordiffiBionmeftod. Drops 
conlammg 5-8 mgs/ml EnnC' in 25mM Tris/O. lOOmM NaQ. 2mM DTT. 10% (v/v) 

NH4(S0K. 15% PEG 8000. pH 7.8. C,ys.als appear^l wifti. „„e day and g™„o ti»i, 
full s,^ w,uun one week. Crysuls belonged .o ttie space group P43212. The s.m«»re of 
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ErmC in this space group was determined by molecular replacement to 2.2 angstrom 
resolution using the crystal structure of ErmC in the space group P6 (Bussiere et al.. 
Biochemistry Vol. 37, pp 7103-71 12). The 3-D structure for crystals grown under these 
conditions show an empty active site making this a system highly suitable for application 
5 of CrystaLEAD™. 

Screening 

The adenosine library consisted of 59 compounds. The library was divided into 7 
mixtures of 8-9 shape-diverse compounds and screened by the CrystaLEAD""^ method. 

10 Specifically, each compound was dissolved in 100% DMSO to a final concentration of IM 
(or saturation for the less soluble). Equal volumes of each compound were mixed to 
assemble the mixture of 10. Single ErmC crystals were placed in 50^1 of 20% PEG 8000, 
0.3M anunonium sulfate, 10% glycerol, pH 7.7 and 0.5-0.8^1 of the compound mixture 
added to give 1 to 1.6% DMSO and 3.3 to 5.2fiM final individual compound 

15 concentration. Crystals were allowed to equilibrate for 3-4 hrs. 

Data were collected on a Rigaku RTF 300 RC rotating anode source with a 
RAXISn, MAR image plate, or MAR CCD detector. For the image plate systems, typical 
data consisted of 15-20 2° oscillations with 20-30min exposures while for the CCD 15-20 
2i0'' oscillations were exposed for 8-15 minutes. Typical usable data were 80-90% 
20 complete at 3,4-3.6A resolution with merging R-factors of 7-16%. This was required to 
• adequately visualize and identify inhibitors in the Fo-Fc or 2Fo-Fc maps. For these maps, 
the starting model had been refined to 2.2A resolution (R=22% Rfree=25%). Data were 
processed by the DENZO program package and the electron density maps calculated by 
■ "the XPLdR package. ~ 

25 Electron density maps were inspected on a Silicon Graphics INDIG02 workstation 

using QUANTA 97. The shape of the density at the active site was visually identified by 
the shape of one or more of the compounds in the mixture to indicate a positive hit or by 
ordered water molecules indicating the absence of binding. For experiments which 
resulted in a positive hit, the appropriate compound was visually moved into the electron 

30 density. The electron density maps were also checked for any changes in the protein 
structure, and if observed, corresponding modifications were made in the structure. 
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Hence. after the map inspection/compound-fitting step, the detailed 3-D structure of the 
compound:protein complex was known. 

Two hits were detected in the ErmC* adenosine analogue screen (compounds 44 
and 45). Figures 13 and 14 show the crystal stmcture of the complexes of compounds 44 
5 and 45 with ErmC. 



HO y— OH 



10 





44 



45 



In all caises. the electron density shape identified the binding compound. The 
hydrophobic substitution was found to bind along a partially exposed hydrophobic surface 
suggesting a preferred interaction which may have contributed to the binding of these 
compounds, allowing them to be pulled out as hits. No hits containing a substitution at the 
5*0H position were detected. A follow-up compound to compounds 44 and 45 contained 
an optimized indane substituent at this hydrophobic site. 



15 
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WE CLAIM: 

1 . A process for identifying a ligand to a target biomolecule comprising, 

a) obtaining a target biomolecule crystal; 

b) exposing the target biomolecule crystal to one or more test samples; 
5 and 

c) obtaining an X-ray crystal diffraction pattern to determine whether a 
ligand/receptor complex is formed. 

2. The process according to Claim 1 further comprising the steps of obtaining an X-ray 
10 crystal diffraction pattern of the target biomolecule crystal prior to exposure to the . 

test samples and comparing the X-ray diffraction pattern of the target molecule 
before and after the exposure. 



3. The process according to Claim 1 further comprising the step of transforming 
15 diffraction pattern into an electron density map. 

4. The process according to Claim 3 further comprising the step of converting electron 
density map into a structure. 

20 5. The process according to Claim 1 , wherein the target biomolecule is exposed to a 
test sample by soaking the target biomolecule crystal in a solution that contains the 
test sample. 



The process according to Claim 1, wherein the target biomolecule is exposed to the 
test samples by soaking the target biomolecule crystal in a solution containing a 
mixture of test samples. 
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7. The process according to Claim 1, wherein the target biomolecule is exposed to the 
test sample by co-crystallizing the target biomolecule crystal with a test sample. 

8. The process according to Claim I . wherein the target biomolecule is exposed to the 
test samples by co-crystallizing the target biomolecule crystal with a mixture of test 
samples. 



9. The process according to Claim 6 , wherein the mixture of test samples are diversely 
shaped. 



10. The process according to Claim 8 . wherein the mixture of test samples are diversely 
shaped. 



1 1. The process according to Claim 1 wherein the ligand is a biologically-active moiety. 

12. The process according to Claim 1. wherein the target is a polypeptide. 

13. The process according to Claim 1 , wherein the target is a re-engineered polypeptide. 

14. A biologically-acUve mpiety idenUfied by the process according to Claim 11. 

15. The process according to Claim 1 wherein said ligand is a lead compound. 

16. A process to design a ligand for a target biomolecule comprising, 

a) obtaining a target biomolecule crystal; 

b) identifying at least two ligands to the target biomolecule by X-ray 
crystallographic screening; 
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c) determining the spatial orientation of the ligands when they are bound 
to the target biomolecule; and 

d) linlcing the ligands together according to the spatial orientation to form 
theligand. 



17. The process according to Claim 16 wherein the spatial orientation of the bound 
ligands is determined by forming a multi-ligand/target molecule complex and 
generating an X-ray crystal structure of the multi-ligand/target molecule complex. 

18. The process according to Claim 16 wherein one ligand is bound to the target 
molecule before another ligand is bound to the target molecule. 

19. The process according to Claim 16 wherein the ligand is a biologically-active 
moiety. 

20. The process according to Claim 16, wherein the target is a polypeptide. 

21 . The process according to Claim 16, wherein the target is a re-engineered polypeptide. 

22. A biologically-active moiety designed by the process according to Claim 19. 

23. The process according to Claim 16 wherein said ligand is a lead compound. 

.24. A process to design a ligand for a target biomolecule comprising, 

a) obtaining a target biomolecule crystal; 

b) identifying a ligand to tiie target biomolecule by X-ray crystallographic 
screening; 
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c) making derivatives of the ligand. 



25. The protess according to Claim 24 wherein said ligand is a lead compound. 



26. The process according to Claim 24 wherein the ligand is a biologically-active 
compound. 



27. The process according to Claim 24, wherein the target is a polypeptide. 

28. The process according to Claim 24, wherein the target is a re-engineered polypeptide. 

29. A lead compound identified by the process of Claim 25. 

30. A biologically-active compound designed by the process according to Claim 25. 

31. A biologically-active compound designed by the process according to Claim 26. 

32. A process to form a crystal having an easily accessible active site from a biomolecule 
comprising, 

a) co-crystallizing the biomolecule with a degradable ligand; and 

b) degrading the ligand once the crystal is formed. 



33. The process according to Claim 32 wherein the biomolecule active site degrades the 
ligand. 



wo 99/45379 



PCTAJS99/04967 



-41- 

34. The process according to Claim 32 further comprising adding degradation agents to 
degrade the ligand. 



35. The process according to Claim 32 wherein said ligand spontaneously degrades. 
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Figure 2 
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Figure 3 
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Figure 4 
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Figure 6 
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Figure 7 



wo 99/45379 PCTAJS99/04967 

8 / 14 




Figure 8 
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Figure 9 
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Figure 1 1 
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Figure 12 



BNSOOCID: <WO 994537eA2 I > 



wo 99/45379 



13 / 



14 



PCTAJS99/04967 




wo 99/45379 



14 / 14 



PCTAJS99/04967 




Figure 14 
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SEQUENCE LISTING 

<110> Nienaber, Vicki 
Greer, Jonathan 
Abad-Zapatero, Celerino 
Norbeck, Daniel 

'<:120>-\*LIGAND-' SCREENING AND DESIGN BY X-RAY- 
CRYSTALLOGRAPHY 

<130> 6308. US. PI 

V . ; . / <150> 09/036/184 

y, <151> 1998-03-06 

".; <160> 14 

<176> FastSEQ for Windows Version 3.0 

' " '<210> 1' ■ 

: ' : <211> 51 . 

• • r, : •/ . . . <212> DNA. 

• ■ . , <213> Synthetic 

. ' . • <400> 1 

. ^attaatgtcg actaaggagg tgatctaatg ttaaaatttc agtgtggcca a 

<210> 2 
<211> 57 
<212> DNA 
<213> Synthetic 

<400> 2 . 

^attaataagc tttcagaggg ccaggccatt ctcttccttg gtgtgactcc tgatcca 

<210> 3 
<211> 47 
<212> DNA 
<213> Synthetic 

<400> 3 

attaattgcg cagccatccc ggactataca gaccatcgcc ctgccct 
47 

<210> 4 

<211> 46 

<212> DNA 

<213> Synthetic 
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<400> 4 

attaatcagc tgctccggat agagatagtc ggtagactgc tctttt 
46 

<21p> 5 
<211> 28 
<212> DNA 

- — <213> -Synthetic 

<400> 5 

attaatcagc tgaaaatgac tgttgtga 
28 

<2i0> 6 
<211> 51 

<212> DNA 
<213> Synthetic 

<400> 6 

^attaatgtcg actaaggagg tgatctaatg ttaaaatttc agtgtggcca a 

• <210> 7 
<211> 37 
<212> DNA 
<213> Synthetic 

<400> 7 

^attaatgcta gcctcgagcc accatgagag ccctgct 

<210> 8 
<211> 42 
<212> DNA 
<213> Synthetic 

<400> 8 

attaatgcta gcctcgagtc acttgttgtg actgcggatc ca 
42 

<210> 9 
<211> 44 
<212> DNA 
<213> Synthetic 

<400> 9 

ggtggtgaat tctcccccaa taatgccttt ggagtcgctc acga 
44 

<210> 10 
<211> 111 
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<212> DNA 

<213> Yeast Pichia Pastoria 
<400> 10 

atgttctctc caattttgtc cttggaaatt attttagctt tggctacttt gcaatctgtc 
60 

ttcgctcagc cagttatctg cactaccgtt ggttccgctg ccgagggatc c 

111 • - • .... 

<210> 11 
<211> 22 
<212> DNA 
<213> Synthetic 

<400> 11 
gaaacttcca aaagtcgcca ta 
22 

<210> 12 
<211> 92 
<212> DNA 
<213> Synthetic 

<400> 12 

attaatgaat tcctcgagcg gtccgggatc cctcggcagc ggaaccaacg gtagtgcaga 
6 0 

taactggctg agcgaagaca gattgcaaag ta 
92 

<210> 13 
<211> 46 
<212> DNA 
<213> Synthetic 

<400> 13 

attaatggat ccttggacaa gaggattatt gggggagaat tcacca 
46 

<210> 14 
<211> 47 
<212> DNA 
<213> Synthetic 

<400> 14 

attaatctcg agcggtccgt cacttggtgt gactgcgaat ccagggt 
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(57) Abstract 

X-ray crystallography can be used to screen compounds that are not known 
ligands of a target biomolecule for their ability to bind the target biomolecule. The 
method includes obtaining a crystal of a target biomolecule; exposing the target 
biomolecule crytal to one or more test samples; and obtaining an X-ray crystal 
diffraction pattern to determine whether a ligand/receptor complex is formed. The 
target is exposed to the test samples by either co-crystallizing a biomolecule in the 
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of one or more test samples. In another embodiment, structural information from 
iigand/receptor complexes are used to design ligands that bind tighter, that bind more 
specifically, that have better biological activity or that have better safety profile. A 
further embodiment of the invention comprises identifying or designing biologically- 
active moieties by the instant process. In a further embodiment, a biomolecule crystal 
having an easily accessible active site is formed by co-crystallizing the biomolecule 
with a degradabte ligand and degrading the ligand. 
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